JKimmo: A Multilingual Computational Morphology Frame- work for PC-KIMMO
نویسنده
چکیده
Morphological analysis is of fundamental interest in computational linguistics and language processing. While there are established morphological analyzers for mostly Western and a few other languages using localized interfaces, the same cannot be said for Indic and other less-studied languages for which language processing is just beginning. There are three primary obstacles to computational morphological analysis of these less-studied languages: the generative rules that define the language morphology, the morphological processor, and the computational interface that a linguist can use to experiment with the generative rules. In this paper, we present JKimmo, a multilingual morphological open-source framework that uses the PCKIMMO two-level morphological processor and provides a localized interface for Bangla morphological analysis. We then apply Jkimmo to Bangla computational morphology, demonstrating both its recognition and generation capabilities. Jkimmo’s internationalization (i18n) framework allows easy localization in other languages as well, using a property file for the interface definitions and a transliteration scheme for the analysis.
منابع مشابه
Greek Compounds : A challenging case for the parsing techniques of PC - KIMMO v . 2
In this paper we describe the recognition process of Greek compound words using the PC-KIMMO software. We try to show certain limitations of the system with respect to the principles of compound formation in Greek. Moreover, we discuss the computational processing of phenomena such as stress and syllabification which are indispensable for the analysis of such constructions and we try to propose...
متن کاملGenerating the Translation Equivalent of Agentive Nouns Using Two-Level Morphology
This paper is about generation of translation equivalent of agentive nouns with the use of automatically learned two-level phonological rules. The system is implemented using the PC-KIMMO environment. The basis for the research presented in this paper are two lexicons that contain a list of agentive nouns in Macedonian and English including their components (noun, verb, adjective, pronoun) and ...
متن کاملParsing Deficiencies of the Pc-kimmo System
In this paper, we discuss the possibilities and limitations of the PC-KIMMO system as a recognition device of compound formations in a language like Modern Greek, where compounding interacts with derivation, inflection and lexical phonology. We deal with the computational processing of nominal and verbal compounds and try to show certain limitations of the PCKIMMO software with respect to the p...
متن کاملApplying Semantic Frame Theory to Automate Natural Language Template Generation From Ontology Statements
Today there exist a growing number of framenet-like resources offering semantic and syntactic phrase specifications that can be exploited by natural language generation systems. In this paper we present on-going work that provides a starting point for exploiting framenet information for multilingual natural language generation. We describe the kind of information offered by modern computational...
متن کاملCriteria for Computational Models of Morphology: The Two-Level Model as an NLP Framework
Computational models of morphology are best seen not as morphologi cal models but rather ets natural language processing frameworks which can express descriptions in the style of one morphological model or the other, and even go further, but without necessarily being bound by “purely” theoretical considerations. Criteria for their adequacy can be derived by treating them (together with the lin...
متن کامل